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Abstract 

Weighted caching is a generalization of paging in which the cost to 
evict an item depends on the item. We give two results concerning 
strategies for these problems that incur a cost within a factor of the 
minimum possible on each input. 

We explore the linear programming structure of the more general 
k-server problem. We obtain the surprising insight that the well- 
known "least recently used" and "balance" algorithms are primal- 
dual algorithms. We generalize them both, obtaining a single k _\ +l - 
competitive, primal-dual strategy for weighted caching. 

We introduce loose competitiveness, motivated by Sleator and Tar- 
jan's complaint pT85 that the standard competitive ratios for paging 



strategies are too high. A fc-server strategy is loosely c(k)- competitive 
if, for any sequence, for almost all k, the cost incurred by the strategy 
with k servers eithens no more than c{k) times the minimum cost or is 
insignificant. We show that fc-competitive paging strategies including 
"least recently used" and "first in first out" are loosely c(fc)-competitive 
provided c(fc)/lnfc — > oo. We show that the (2 In fc)-co mpetitive, ran- 
domized "marking algorithm" of Fiat et al. [ FKL + 9l"| is loosely c(k)- 
competitive provided c{k) — 21nlnfc — > oo. 



*This research was performed while the author was at the Computer Science De- 
partment, Princeton University, Princeton, NJ 08544, and was supported by the Hertz 
Foundation. 



The body of this paper consists of four sections. In §1, the introduction, 
we describe background, our results, and related work. In §2, we give our 
weighted caching strategy. In §3, we show loose competitiveness of various 
paging strategies. We conclude with comments about further research in §4. 



1 Introduction 

Many real problems must be solved on-line — decisions that restrict possible 
solutions must be made before the entire problem is known. Generally, one 
can not guarantee an optimal solution if one must solve a problem on-line. 
Thus a natural question for such a problem is whether a strategy exists that 
guarantees an approximately optimal solution. 

In this paper we study the k-server problem [ McG87 , MMS90[| . Vari- 



ous definitions of the problem exist in the literature; we take the following 
definition, which is technically convenient and essentially equivalent to the 
other definitions: One is given a complete directed graph with edge lengths 
d{u,v), a number, k, of identical, mobile servers, and a sequence r of re- 
quests, each to some node. In response to the first request, all servers are 
placed on the requested node. In response to each subsequent request v, if 
no server is on v , some server must be chosen to move from its current node 
u to v at a cost of d(u, v). A strategy for solving the problem is on-line if it 
chooses the server to move independently of later requests. The goal is to 
minimize the total cost. 



As many authors (e.g. Chrobak and Larmore [CL91]) have pointed out, 
the ^-server problem is an abstraction of a number of practical on-line prob- 
lems, including linear search, paging, font caching, and motion planning for 
two-headed disks. 

We focus on two special cases of the /c-server problem: the weighted 
caching problem [MMS9C], in which d(u,v) = w(u) for u ^ v (the cost 



to move a server from a node depends only on the node), and the paging 
problem [ST85], in which the cost to move a server is uniformly 1. 



Traditionally, paging is described as the problem of managing a fast 
memory, or cache, capable of holding k items: items are requested; if a 
requested item is not in the fast memory, it must be placed in the fast 
memory, possibly evicting some other item to make room. The goal is to 
minimize the fault rate — the number of evictions per request. Weighted 
caching is similar, except that the cost to evict an item depends on the item. 

For these two problems, for technical reasons and without loss of gener- 
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ality, we replace the asumption that all servers begin on the first requested 
node with the assumption that initially no servers reside on nodes, and, in 
response to any request, any server that has not yet served a request may 
be placed on the requested node at no cost. 

Following a number of authors (Sleator and Tarjan [ST85]; Borodin, 
Linial, and Saks [ BLS87 |; and Manasse, McGeoch, and Sleator | MMS90fl ), 
we are interested in strategies that are competitive, that is, strategies that 
on any sequence incur a cost bounded by some constant times the minimum 
cost possible for that sequence. Formally, 

• r denotes an arbitrary sequence of requests. 

• X denotes some on-line /c-server strategy. 

• k denotes the number of servers given to the on-line strategy. 

• Opt denotes the (off-line) strategy producing a minimum cost solu- 
tion. 

• h denotes the number of servers given to Opt. 

• C r (X, k) denotes the (expected) cost incurred by the (randomized^) 
strategy X with k servers on request sequence r. 

• A strategy X is c- competitive for a given r, h, and k, when 

C r (X,k) < c-C r {OPT,h) + b, 

where b depends on the initial positions of the optimal and on-line 
servers, but is otherwise independent of r. 

• A strategy X is c(h,k)- competitive when X is c(h, fc)-competitive for 
all r, h and k. C(h, k) is then called a competitive ratio of X. 

Note that the competitiveness of a strategy is unrelated to its computational 
complexity. 

Before we describe our results, here is a summary of the strategies rele- 
vant to our work. 



We implicitly assume that the input re quests are in dependent of the random choices 
made by the strategy; for other models see |[BDBK + 9C ] . 
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• LRU, FIFO, and Fwf are, respectively the "least recently used", "first 
in first out", and "flush when full" paging strategies. Lru moves the 
server from the least recently requested, served node. Fifo, which can 
be obtained from Lru by ignoring served requests, moves the least 
recently moved server. Fwf evicts all items from the fast memory 
(removes all servers from the graph at a cost of k) when the fast 
memory is full and the requested item is not in the fast memory. 

• Mark is the marking algorithm, a randomized paging strategy. Mark 
may be described as follows: if the requested node has no server, mark 
all servers if none are marked, and then move and unmark a marked 
server chosen uniformly at random; if the requested node has a server, 
unmark that server. 

• Balance is the balance algorithm, a /c-server strategy. In response to 
request r, Balance moves the server from served node u minimizing 
d(u,r) + W(u), where W(u) denotes the net distance traveled so far 
by the server on u. Balance generalizes Fifo. 



1.1 A primal-dual strategy for weighted caching 

In §2 we introduce and analyze GreedyDual, a new, primal-dual, de- 
terministic, on-line weighted-caching strategy that is (optimally) 
competitive. Figure |l] contains a direct description of GreedyDual. 

GreedyDual is of practical interest because it generalizes Lru, one of 
the best paging strategies, to weighted caching. GreedyDual also gener- 
alizes BALANCE for weighted caching, and thus FlFO.0 

GreedyDual is of theoretical interest because its analysis is the first 
primal-dual analysis^] of an on-line algorithm and because the analysis, which 
shows an (optimal) competitive ratio of , is the first to show a ratio 

less than k when h < k for any fe-server problem more general than paging. 
A consequence of this reduced ratio is that GreedyDual has a constant 
competitive ratio provided h is any fraction of k. 

2 The natural generalization of Lru for weighted caching can be obtained by ignoring 
the L[-] values and lowering as much as possible in the Relabel step. Balance, as it 
specializes for weighted caching, can be obtained by ignoring the H[-] values and lowering 
as little as possible in the Relabel step. 

3 We will assu me fam iliarity with linear programming primal-dual techniques. For an 



introduction, see |PS82{ 
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GreedyDual 

Maintain a pair of real values L[s] < H[s] with each server s. 
In response to each request, let v be the requested node; 

if some server s is on v then 

let H[s] «- w(v) 
else if some server s has yet to serve a request then 

let L[s] <- #>] <- 

Place son«. 

else 

Relabel: Uniformly lower L[s] and H[s] for all s so that 

min Lis] < < mini/ [si. 

s s 

Move any server s' such that L[s'] < to v. 
let L[s'] <- i?[s'] «- io(t>) 



Figure 1: The weighted caching algorithm GreedyDual 

We feel that the primal-dual approach, well developed for exact opti- 
mization problems, is also important for approximation problems, including 
on-line problems, because primal-dual considerations help reveal combina- 
toric structure, especially how to bound optimal costs. The primal-dual ap- 
proach also has the potential to unify the arguably ad hoc existing on-line 



analyses. For instance, the analyses of Lru and Fifo | ST85 |, of Balance 



for weie hted caching ||CKPV91|| , and of Mark [ |FKL + 91| can all be cast as 



closely related primal-dual analyses. The primal-dual approach can also re- 
veal connections to existing optimization theory. For these reasons, we take 
pains to make explicit the primal-dual framework behind our analysis. 

Here is a sketch of our primal-dual approach. The /c-server problem has 
a natural formulation as an integer linear program (IP) that is essentially 
a minimum-weight matching problem. Relaxing the integrality constraints 
of IP yields a linear program (LP) (which, incidentally, has optimal integer 
solutions). GreedyDual implicitly generates a solution to the dual pro- 
gram (DP) of LP. The dual solution serves two purposes: GreedyDual 
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uses the structural information that the solution provides about the prob- 
lem instance to guide its choices, and GreedyDual uses the cost of the 
dual solution as a lower bound on C r (OPT, h) to certify competitiveness. 
Related work includes the following. Sleator and Tarjan ST85[| show 



that Lru and FIFO are k _ h+1 -competitive, and that this ratio is optimal for 
deterministic, on-line paging strategies. A similar analysis shows that Fwf 
is also -competitive. 

Fiat et al. [ FKL + 91 , |You91a ] introduce and analyze Mark, showing 



that it is 2i/fc-competitive (H^ ~ In A;), and showing that no randomized 
paging strategy is better than ^-competitive when h = k. McGeoch 



and Sleator [MS89] subsequently give a -fffc-competitive randomized paging 
strategy. Young [ You91bj ] shows that Mark is roughly 2 In ^^-competitive 
when h < k and that no randomized strategy is better than roughly In 
competitive. 



Manasse, McGeoch, and Sleator |MMS90(| show that Balance is k- 



competitive for the general problem provided only k + 1 distinct nodes 
are requested, and that no deterministic algorithm is better than 



k-h+l 

competitive in any graph with at least k + 1 distinct nodes. 

Chrobak, Karloff, Payne, and Vishwanathan | |CKPV9l[ show that Balance 



is /^-competitive for weighted caching. Independently of their analysis of 



Balance, Chrobak, Karloff, Payne, and Vishwanathan [CKPV91] formu- 
late the fc-server problem as an integral-capacity minimum-cost maximum- 
flow problem and use this formulation to give a polynomial time algorithm 
to find a minimum-cost solution. 

The primal-dual approach has been used extensively for exact optimiza- 
tion problems [ PS82| 1, and is used implicitly in a number of recent analyses 



of approximation algorithms. Goemans and Williamson [ GW92| 1 explicitly 



use the approach for finding approximate solutions to NP-hard connectivity 
problems. 

1.2 A more realistic A>server model 

In §3 we give the second contribution of this paper: loose competitiveness. 
Loose competitiveness is motivated by Sleator and Tarjan's | ST85[| com- 



plaint that (when h = k) the competitive ratios of paging strategies are 
too high to be of practical interest. We have done simulations that suggest 
that in practice good paging strategies usually incur a cost within a small 
constant factor of minimum. The graph in Figure § plots competitive ratio 
C r (X, k)/C r (Opt, k) versus k for a number of paging strategies on a typical 
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Compet it iveness 




-Cache Size 



Figure 2: Competitiveness ( - nj^Pjj k ^ ) vs. k for typical r 



sequence 

We would like to keep the worst-case character of competitive analysis 
but somehow show more realistic competitive ratios. 

• A strategy X is loosely c(k)- competitive when, for all d > 0, for all 
n € J\f, for any request sequence r, only o(n) values of k in {1, . . . , n} 
satisfy 

C r (X,k) > max{c(/c) • C r (C-PT, A;),C r (OPT, l)/n d } + b, 

where b depends only on the starting configurations of X and Opt, 
and the o(n) is independent of r. 

That is, X is loosely c(fe)-competitive when, for any sequence, at all but a 
vanishing fraction of the values of k in any range {1, n}, either X is c(k)- 
competitive in the usual sense, or the cost to X with k servers is insignificant 



4 The input sequence, traced by Dick Sites }5A86 ], consists of 692,057 requests to 642 
distinct pages of 1024 bytes each. The sequence was generated by two X-windows network 
processes, a "make" (program compilation), and a disk copy running concurrently. The 
requests include data reads and writes and instruction fetches. 
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(or both). For instance, if a paging strategy is loosely 3 In In /c-competitive, 
then, for any fixed d > 0, on any sequence, for almost any choice of k in 
any range {1, 2, re}, the fault rate will be either at most l/n d or at most 
3 In In k times the minimum possible using a cache of size k. 

This model is realistic provided input sequences are not critically corre- 
lated with k and provided we are only concerned about being near-optimal 
when the cost is significant. Both criteria are arguably true for most paging 
applications. 

• A paging strategy is conservative if it moves no server from a node 
until all servers have been placed on the graph, and it moves servers 
at most k times during any consecutive subsequence requesting k or 
fewer distinct items. 



Any conservative 



Lru, Fifo, Mark, and even Fwf are conservative, 
paging strategy is fc _^ +1 -competitive [You91a]. 

The results we obtain follows: any conservative paging strat- 

egy is loosely c(/c)-competitive provided c(k)/\nk — > oo and both c(k) 
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and k/c{k) are non-decreasing; Mark is loosely c(fc)-competitive provided 
c(k) — 2 In In /c — > oo and both c(k) and 2 In k — c(k) are non-decreasing. 

Loose competitive ratios are thus shown to be exponentially lower than 
standard competitive ratios. 

Borodin, Irani, Raghavan and Scheiber [BIRS91| give a related work, in 
which the possible request sequences are quantified by the degree to which 
they exhibit a certain kind of locality of reference, and competitive ratios 
are considered as a function of this parameter. The work is extended by 
Irani, Karlin, and Phillips [IKP92|. The ratios shown in their model are, 
in most cases, much higher than the loose competitive ratios established in 
this paper. 



2 GreedyDual 

In this section we develop and analyse GreedyDual. We first develop a 
linear programming framework for the general A:-server problem, and then 
we present and analyze GreedyDual as a primal-dual algorithm within 
this framework. 

2.1 The A>server dual 

Fix a request sequence ro, r±, rjv, so that request % is to node r,. 

We next define IP, an integer linear program whose feasible solutions 
correspond to solutions of the £>server problem given by r. The variables 
of IP are {xij : < i < j < N}, where G {0, 1} is 1 if and only if the 
request served by the server of request j before serving request j is request 
i. 

After defining IP, we construct its fractional relaxation LP, and the dual 
DP of LP. 
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• IP(fc) (or just IP, if is determined by context) denotes the integer 
linear program 



minimize 



d ( r i,rj) 



0<i<j<N 



subject to < 



x(out(0)) 
x(out(i)) 
x(m(i)) 



< 



< 



k 



1 (1 < i < JV — 1) 

1 (1 < i < N) 

{0, 1} (0 < i < j < N) 



where out(z) denotes the set ■ i < j < N}, in(i) denotes the set 

{CM) : < j < i}, and x(S) = E(ij) € 5 x ij- 

For the weighted caching and paging problems (where initially no 
servers reside on the graph, and each server is allowed to serve its 
first request by being placed on the requested node at no cost), IP 
is defined as above, but we stipulate that request is to an artificial 
node that is never requested again and that is at distance to all 
later requests. With this stipulation the initial conditions for the gen- 
eral problem reduce to the initial conditions for weighted caching and 
paging. 

• LP(/c) (or just LP) denotes the relaxation of IP (obtained by replacing 
each constraint € {0, 1} with the constraint < Xij < 1). 



• || (a, b) \\h denotes the cost, — hao — J2i>i a i + J2i h, of a feasible solution 
to DP(h). 

Note that the dual constraints are independent of h, so that a dual solution 
is feasible independently of h. 

By duality, for any feasible dual solution (a, b), 



• BP(h) (or just DP) denotes the dual of LP(h): 



maximize — hao — a i + 2^ &i 

l<i<JV-l l<i<N 




C r (OPT,h) > \\(a,b)\\h 
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Incidentally, a standard transformation shows that IP is equivalent to a 
minimum- weight, bipartite, perfect matching problem^, and thus that LP 
has optimal integer solutions, so that, for given r and h, the above bound is 
tight for some (a, 6). 

2.2 The algorithm 

Here are the definitions and notations specific to GreedyDual: 

• A request has a server if the request has been served and the server 
has not subsequently served any other request. 

• The notation i~ denotes the most recent request (up to and including 
request i) that resulted in the server of request i moving. We define 
0" = 0. 

• S denotes the set {i : request i has a server.}. (More correctly, S is a 
multiset, as occurs in S once for each server on node rrj. Any i > 
can occur only once.) 

• (a, b) denotes a feasible dual solution maintained by GreedyDual. 

GreedyDual responds to each request as follows. If the requested node 
has a server, it does nothing. Otherwise, it uniformly raises a subset of the 
dual variables enough to account for the cost of moving some server, but not 
so much that feasibility is violated. It then moves a server whose movement 
cost can be accounted for. The full algorithm is given in Figure ||. 

2.3 Analysis of the algorithm 

A simple proof by induction on n shows that every b{ > that frj+i < 
w(rj) for i £ S (two facts that we use again later), and that the Relabel 

5 It may be useful for the reader in understanding LP to study the equivalent minimum- 
weight perfect matching problem, so we briefly outline it here. Construct a weighted 
bipartite graph G = (U,W,E), with U = {A , Ai, A N -i}, W = {Si, B N }, E = 
{{Ai, Bj) £ U x W : i < j}, and w(Ai, Bj) = d(ri, r 3 -). Each solution x to IP corresponds 
to the subset {e : x e = 1} of E. The cost of x equals the net weight of edges in the subset. 
Such subsets are exactly those such that every vertex in W touches one edge in the subset, 
every vertex in U except Aq touches at most one edge, and Ao touches at most k edges. 
We can leave the problem in this form, or we can convert it into a true perfect matching 
problem by duplicating Ao with its edges k times and adding k copies of a new node B^ 
with zero-cost edges from every Ai. 
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GREEDYDUAL(r,i) 

Comment: moves servers in response to requests ro, r\, rjv, maintain- 
ing (a,b), a dual solution, and S, a multiset containing the currently 
served requests, such that (a, b) is feasible and the distance traveled 
by servers is at most k _ k h+1 \\{a,b)\\ h - EiesV+i- 

In response to request 0: 

let ai_i <— £>j <— for i = 1, N 

let 5 <— the multiset containing request with multiplicity k 
In response to each subsequent request n > 0: 

if node r n has a server then 

Stay: choose i E S such that = r n 

let 5 <— S U {n} — {i}, satisfying request n 

else 

Relabel: Uniformly raise the dual variables in the set 

{a,i : < i < n - 1, i £ S} U {&; : 1 < i < n} 

so that (Vi G 5) < u;(r,) but (3i G 5) 6j- + i > u>(rj). 
Move: choose i G 5 such that 6j- + i > w(r^) 

let 5 <— 5 U {n} — {i}, satisfying request n 



Figure 4: GreedyDual primal-dual algorithm 

step can in fact be performed. All other steps can be seen to be well-defined 
by inspection, and clearly GreedyDual produces an appropriate sequence 
of server movements. This establishes the correctness of GreedyDual. 

To establish that GreedyDual is fc _^ +1 -competitive, we show two in- 
variants: that the dual solution (a, b) is feasible, and that the distance trav- 
eled by servers is bounded by fc _^ +1 \\(a, b)\\h — X^es&i-+i- Since every bi 
is nonnegative and \\(a, b)\\h is a lower bound on C r (OPT, h), this gives the 
result. 

Lemma 2.1 GreedyDual maintains the invariant that (a,b) is feasible. 
Proof: By induction on n. 
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Clearly (a, b) is initially feasible. 

The only step that changes (a, b) is Relabel. 

Clearly Relabel maintains that every Oj is nonnegative. 

Thus the only dual constraint that Relabel might violate is of the form 

bj - a,i < d(ri,rj) 

for some < i < j < N. 

By inspection of the Relabel step, such a constraint can only be vio- 
lated if i <G S and j < n. 

In this case, en = and r{ ^ rj because i has a server, so the constraint 
reduces to bj < w(ri). 

Since bi + \ < w{r,j) after the step, and bi + \ > bj (since j > i and we have 
already established that every bi > the constraint is maintained. rj 



Lemma 2.2 GreedyDual maintains the invariant that the net distance 
traveled by servers is bounded by 

Proof: By induction on n. 

Clearly the invariant is initially true. 

The Stay step leaves the net distance and the bound unchanged. 

The Relabel step also leaves the net distance and the bound unchanged. 
If ^ S, that the bound remains unchanged can be seen by inspecting the 
definition of ||(a, b)\\h, and noting that when the dual variables are raised, 
n — k of the a^'s, including ao, and n of the b^s increase. Consequently 
|| (a, b) \\h is increased by k — h + 1 times as much as any individual term, 
and, in the bound, the increase in the minuend exactly counterbalances the 
increase in the subtrahend. 

If G S, the bound remains unchanged because the constraint b\ < 
w(ro) = ensures that the raise is degenerate — that the dual variables are 
in fact unchanged. 

The Move step increases the distance traveled by w(ri), and increases 
the bound by — & n -+i- Since n~ = n, 6 ra +i = 0, and > w(ri), 

the bound is increased by at least w(ri), and the invariant is maintained. 

□ 
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Corollary 2.3 GreedyDual is k _^ +1 -competitive. 

Note that in order to implement GreedyDual, only the values L[sj\ = 
w(ri) — bi+i and H[si] = w(ri) — (for each server Si of a request i € S) 

need to be maintained, and that the artificial first request may be dropped, 
instead placing the servers on nodes when they first truly serve a request. 
We leave it to the reader to verify that these modifications lead to the direct 
description of GreedyDual given in Figure ffl. 



3 Loose Competitiveness 

In this section we give our analyses of loose competitiveness of paging strate- 
gies. The theorems and lemmas in this section, except as noted, first ap- 
peared in [You91b, You91a], 



Recall that a /c-server strategy is loosely c(k)- competitive if, for any d, for 
any n, for any request sequence r, only o(n) values of k £ {1, ...,n} satisfy 

Cr(X,k) > max{c(/c)C r (OPT,A;),C r (OPT,l)/n d } + b. 

The following terminology is essentially from Fiat et al.'s [FKL + 91, 



You91a, You91b| l analysis of the marking algorithm. Given a sequence 



r 



and a positive integer k, 



The k-phases of r are defined as follows. The first /c-phase is the 
maximum prefix of r containing requests to at most k distinct nodes. 
In general, the ith fe-phase is the maximum substring^] of r beginning 
with the request, if any, following the i — 1st fc-phase and containing 
requests to at most k distinct nodes. 

Thus the i + 1st A;-phase begins with the (new) request that would 
cause Fwf to flush its fast memory for the ith time. 

V r (k) denotes the number of /c-phases, minus 1. 

A new request (for a given k) in a /c-phase (other than the first) is a 
request to a node that is not requested previously in the fc-phase or in 
the previous fc-phase. Thus in two consecutive fc-phases, the number 
of distinct nodes requested is k plus the number of new requests in the 
second /c-phase. 



^By "substring" we mean a subsequence of consecutive items. 
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• N r {k) denotes the average number of new requests per /c-phase of r 
other than the first. 

Thus the total number of new requests in r for a given k is N r {k)-V r {k). 

Our analysis has two parts. In the first part (Theorem |3.2| ) we show that, 
for any sequence, few values of k yield both a large number of /c-phases and 
a low average number of new requests per /c-phase. 



In the second part, we show (Lemma 3.4) that, for the paging strategies 
that interested us, for a given sequence and k, the cost incurred by the 
strategy is proportional to the number of /c-phases, and the competitiveness 
is inversely related to the average number of new requests per /c-phase. 
Consequently (Corollary |3.5|), by the first part of the analysis, few values of 
k yield both a high cost and a high competitiveness. 

The key technical insight (Lemma [O]) for the first part of the analysis 
is that if, for a given k, the average number of new requests per /c-phase 
is low, then, for k' just slightly larger than k, the number of /c'-phases is a 
fraction of the number of /c-phases. 

Lemma 3.1 Fix a sequence r. For any k, and any k! > k + 2J\f r (k), 

V r {k') < - A V r {k). 

Proof: Let po, . . . ,pp r (k) denote the /c-phase partitioning of r. 

At least half (and thus at least \V r (k)/2~\) of the V r (k) /c-phases pi, ■ ■■,p-p r (k) 
have a number of new requests not exceeding 2M r {k). Denote these by 

Pin ■ ■ ■ >P«[7Mfc)/2i ' 

If we modify the /c-phase partitioning of r by joining p%-—\ and pi - for odd 
j, we obtain a coarser partitioning of r into at most T > r {k) — pP r (fc)/4] pieces. 
In the coarser partitioning, each piece resulting from a join references at most 
k + 2J\f r (k) < k' distinct nodes, while each pieces remaining unchanged from 
the /c-phase partioning references at most k <k! distinct nodes. 

If we now consider the /c'-phase partitioning, we find that each /c'-phase 
must contain the final request of at least one of the pieces in the coarser 
partition, because if a /c'-phase begins at or after the beginning of a subse- 
quence of requests to at most k + 2J\f r (k) distinct nodes, it will continue at 
least through the end of the subsequence. 

Thus Vr(k') < V r {k) - \V r {k)/A\ < \V r {k). □ 

From this we show that there are not too many values of k yielding both 
a low average number of new requests per /c-phase and a significant number 
of /c-phases: 
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K{ki) 


< 


M, 


ki+i — ki 


> 


2M, and 


Vr{h) 


> 





Theorem 3.2 For any e > 0, M > 0, and any sequence r, the number of k 
satisfying 

M r (k) < M and V r {k) > eV r (l) (1) 

is 0(M In 1). 
Proof: 

Let s be the number of k satisfying the condition. 

We can choose I = [s/[2M]] such k so that each chosen k differs from 
every other by at least 2M. Then we have 1 < k\ < < ■ • • < ki such that 
for each i 

(2) 
(3) 
(4) 

Then for any i , by (j2j) and (||), ki + \ > ki + 2N T {ki), so, by Lemma [O], 
V r {h+l) < {3/4)V r (ki). Inductively, V r {h) < (3/ %) l - 1 P r {l). 
This, and (|), imply (3/4)'- 1 > e, so 

\sf\2M~\~\ -1 = 1-1 < ln 4/3 
This implies the bound on s. 

This establishes the first part of the analysis. 

We begin the second part by showing that Opt's cost per fc-phase is at 
least proportional to the average number of new requests per fc-phase: 

Lemma 3.3 For arbitrary paging request sequence r, and arbitrary k, h > 0, 

Cr(OPT,h)/V r (k) > (k-h + M r (k))/2 (5) 

(We use only the case k = h, but prove the general case.) 
Proof: Let (1 < i < V r {k)) denote the number of new requests in the 
ith. fc-phase, so that M r {k)V r {k) = J2i>i m i- During the i — 1st and ith 
/c-phases, k + mi distinct nodes are referenced. Consequently, any strategy 
for r with h servers makes at least k + rrii — h server movements during the 
two phases. Thus the total cost for the strategy is at least 

max I J2(k - h + m 2i+ i),J2(. k ~ h + ™2i) > > (k - h + M r {k))V r {k) /2. 

I i>\ i>l 
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□ 

We next show that the strategies that interest us incur a cost propor- 
tional to ft times the number of ft-phases, and (using the above lemma) that 
the strategies have competitiveness inversely related to the average number 
of new requests per ft-phase: 

Lemma 3.4 Let X denote any conservative paging strategy. Let Mark 
denote the marking algorithm. Then 

V r {k) > C r (X,k)/k (6) 

Proof: Bound @ follows directly from the definition of conservativeness. 

Bound (0) follows from Bound (@) and Bound @ of Lemma O, applied 
with h = k. 

Finally, we prove Bound @. Fix a request sequence r, and let mi (1 < 
i < T-V(ft)) denote the number of new requests in the ith ft-phase. Fiat et 
al. |FKL+91| j showQ that C r (MARK, k) < ^ rrii(H k - H m + 1). 



Since H a -H b = Eta+i \ < fa = ln f > letting /(m) = m(l + ln A), 
C r (MARK, fc) < Ya f(mi)- Since / is convex, C r (MARK, ft) < V r {k) f(J\f r (k)). 
Applying Bound Q), C r (MARK, ft) < 2C r (OPT, ft) f (M r (k))/M r (k), which 
is equivalent to Bound ^. □ 



We have established (Theorem |3.2| ) that for any sequence there are few 
values of ft yielding many ft-phases and a low average number of new requests 
per ft-phase. 



We have established (Lemma 3.4) that for the strategies we are interested 



in, the cost they incur with ft servers on a sequence is proportional to the 

7 Mark moves a server chosen uniformly at random from those on nodes not yet 
requested in the current fc-phase. 

Briefly, the analysis of Mark classifies nonnew requests within a phase into repeat 
requests (to nodes already requested this phase) and old requests (to nodes requested in 
the previous phase but not yet in this phase); the expected cost for the ith old request is 
bounded by m/(k — i + 1) because at least k — m — i + 1 of the k — i + 1 nodes requested 
in the previous phase but not in this phase are served, each with equal probability. 
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number of fc-phases, while the competitiveness is inversely related to the 
average number of new requests per fc-phase. 

Finally, we combine the two parts to show that, for any sequence, there 
are few values of k for which our strategies incur high cost and high com- 
petitiveness. This establishes loose competitiveness. 

Corollary 3.5 Let X denote any conservative paging strategy andC : AA + — > 
7Z + a nondecreasing function. 

X is loosely c{k) -competitive provided that k/c(k) is nondecreasing and 

while Mark is loosely c(k)- competitive provided 2\nk—c(k) is nondecreasing 
and 

c(k) - 2 In In A; -> oo. (10) 

Proof: Let X denote either any conservative paging strategy, in which 
case we assume condition (g) and that k/c(k) is nondecreasing, or Mark, 
in which case we instead assume condition ( |l0| ) and that 21n/c — c(k) is 
nondecreasing. 

We show that, for any d > 0, n > 0, and request sequence r, the number 
of violators k £ {1, . . . , n} is o(n), where a violator is a A; such that 

C r (X,k) > m&x{c(k)C r (OPT,k),Cr(OPT,l)/n d }. 

Let A; be a violator. Then bound (0) implies 

P , m > > c -(°; + f> _ J_ n . (1) . 

Bound (|^) and the monotonicity of k/c(k) imply 

« . . C r (OPT,k) 2k 2n 
K(k) < 2k r \ I > < — <— 12 
C r (X,k) c(k) cyn) 

Since each violator k satisfies (|Tl|) and (|l^), by Theorem |3.2j , the number of 
violators is O ^lnn rf+1 ^ n/c(n)j. This is o(n) by assumption (||). 

If X = Mark, then bound (||) and the monotonicity of 2 In A; — c(k) 
imply, for each violator k, that 

, / 1 C r (MARK, k) 
N r {k) < fcexp 1 - - 



2 C r (OPT,/c) 

< fcexp(l -c(fc)/2) 

< nexp(l - c(n)/2), (13) 
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so that by bounds ([Tl]) and ([T^) and Theorem [T^ the number of violators 
is O (^(lnn d+1 ^j nexp(l — c(n)/2)J . This is o(n) by assumption (|lC|). 

□ 



4 Concluding Remarks 

We conclude in this section with comments about further avenues of re- 
search. 

Historically, the role of duality in solving optimization problems is well- 
explored: dual solutions are used to guide the construction of primal solu- 
tions and to certify optimality. For on-line problems such as the fc-server 
problem, duality can serve a similar role; the differences are that the solu- 
tions we seek are approximate, and that the problem we want to solve is 
on-line. For on-line problems, it seems natural to seek a sequence of closely 
related dual solutions, one for each prefix of the request sequence. 

For those interested in extending our approach to the general fc-server 
problem we give the following brief hints. Solutions with monotonic bi's are 
not sufficient to give good bounds: add constraints bi + \ < bi to the dual 
problem and reformulate the primal; in the new primal request sequences 
can be much cheaper than in the old. Raising all of the frj's is probably not 
a good idea: consider a request sequence with requests from two infinitely 
separate metric spaces; a bi should change only when a request is made to 
the metric space of r^. Finally, a promising experimental approach: if r 
is a worst-case sequence for a /^-competitive algorithm X, and the bound 
C r (X,k) < fe|| (a, b)\\h is sufficient to establish competitiveness, then (a, b) 
must be optimal; thus by examining optimal dual solutions for worst-case 
sequences, we may discover the special properties of the (generally non- 
optimal) dual solutions that we seek for such an analysis. A similar technique 
has been tried for potential functions, but in that case each experiment is 
much less informative: it reveals only a single number, not an entire dual 
solution. 

There is a suggestive similarity between potential function and primal- 
dual techniques |You91a ], Briefly, both can be viewed as transforming the 



costs associated with operations so that a sum of local inequalities gives the 
necessary global bound. This connection might yield some insight into the 
special nature of primal-dual analyses for on-line problems. 

Open questions remain concerning loose competitiveness for paging. In 
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[You91a], Theorem 3.2 is shown to be tight, and consequently the analysis 
of loose competitiveness for Fwf is shown to be tight. No lower bounds on 
the loose competitive ratios of Lru, Fifo, or Mark have been shown. 

Finally, two challenges: find a randomized algorithm for weighted caching 
that is better than /c-competitive, and show reduced loose competitiveness 
for a weighted-caching algorithm. A possible hint: the concept of "new re- 
quests" used in analyzing Mark and showing loose competitiveness of pag- 
ing strategies may be captured by an algorithm that mimics GreedyDual, 
but increases each bi at only half the rate that GreedyDual does, and in- 
creases each dj only as much as necessary to maintain the dual constraints. 
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